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Complete mitogenome of the Lesser Purple Emperor Apatura 
ilia (Lepidoptera: Nymphalidae: Apaturinae) and 
comparison with other nymphalid butterflies 
CHEN Mei’, TIAN Li-Li!, SHI Qing-Hui', CAO Tian-Wen^', HAO Jia-Sheng"" 


(1. College of life Sciences, Anhui Normal University, Wuhu Anhui 241000, China; 2. Institute of Plant Protection, 
Shanxi Academy of Agriculture Science, Taiyuan Shanxi 030031, China) 


Abstract: The complete mitochondrial genome of Apatura ilia (GenBank accession no. JF437925) was determined 


as a circular DNA molecule of 15 242 bp, with common genes of 13 putative proteins, 2 rRNAs, and 22 tRNAs and of the 
same gene arrangement as in other sequenced lepidopterans. All protein-coding genes had the typical start codon ATN, 
except for the COI’s using CGA as its start codon as previously demonstrated in other lepidopteran species. The 
comparison of the nucleotide sequences of the A. ilia mitogenome with ten other Nymphalidae species showed nearly 
identical gene orientation and arrangement, with only a few alterations in non-coding fragments. The nucleotide 
composition and codon frequency all fell into the range estimated for the order Lepidoptera. The А. ilia mitochondrial 
genome had the canonical set of 22 tRNA genes folded in the typical cloverleaf structure, with an unique exception of 
tRNA?* (AGN). The mitochondrial genes from A. ilia were overlapped in a total of 33 bp at 9 locations, as well as 
interleaved with a total of 155 bp intergenic spacers, spread over 12 regions with the size ranging from 1 to 49 bp. 
Furthermore, the spacer between ND6 and Cyt b harbored a microsatellite-like repeat (TA); not found in other 
completely sequenced nymphalid genomes. The 403 bp AT-rich region harbored two conserved motifs (ATAGA, 
ATTTA), a 21 bp polyT stretch, а 10 bp poly-A region, along with two microsatellite-like repeats ( (TA), and (TA);), as 
detected in other nymphalid butterflies. 
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摘要 : CHM RAW Apatura ilia( 鳞 翅 目 ， 峡 蝶 科 ) 的 线粒体 基因 组 全 序列 进行 了 测定 , 同时 结合 其 它 
12 (GenBank accession no.: JF43792 
-个 15 242 bp 的 环 状 DNA 分 子 , 包含 13 КАЛАА А. 2S rRNA 基因 和 22 4* tRNA 基因 。13 44 
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编码 基因 中 , 除了 COI 基因 的 起 始 密码 子 是 ССА 外 , Жах 12 个 蛋白 编码 基因 都 具有 标准 的 АТМ 起 始 密码 子 ; m 
Pio E; Hr CSS 10 Fae Е DS E У АНЕЛ 7; TA JL A Te, 只 是 在 非 编 码 序 列 上 存在 细微 的 差异 , 其 
昔 酸 的 构成 及 密码 子 使 用 频率 都 处 于 鳞 翅 目 昆 虫 的 范围 之 内 。22 个 的 tRNA 基因 中 , 除了 (АМААС fie rl 


DHU E, ERHI tRNA 基因 都 显示 为 典型 的 三 叶 草 结构 。 基 因 组 共存 在 9 处 基因 间 
个 基因 间隔 区 (总 长 为 155 bp, 最 长 间隔 是 49 bp, 最 短 的 是 1 bp)。 在 ND6 和 Cytb 间 的 间隔 区 中 还 发 现 有 (TA)» 似 
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营区 (总 长 度 为 33 bp) 以 及 12 


























































































































1 星 结 构 。 与 其 他 峡 蝶 类 相似 , 403 bp 的 AT 富 集 区 包含 有 ATAGA, ATTTA 二 个 保守 模块 (一 个 21 bp 的 poly-T, 




















-个 10 bp 的 poly-A), 以 及 二 个 似 微 卫 星 的 复 结构 ((TA)io 和 (TA)7)。 


关键 词 : 线粒体 基因 组 ; 鳞 翅 目 ; 凿 蝶 科 ; МК TA UR 
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Present butterfly taxonomy classifies the Lesser 
Purple Emperor Apatura ilia into the subfamily 
Apaturinae (Lepidoptera: Nymphalidae) (Chou, 1998). 
This butterfly species was widely distributed in the 
Palaearctic Europe and Asia. It’s name comes from the 
butterfly’s bright violet reflection under sunshine. In its 
lifecycle, larvae feed on the leaves of trees including 
trembling poplar, poplar, aspen and willow, whereas the 
adults mainly feed on tree sap and animal dejecta (Chou, 
2000). Due to habitat destruction and environmental 
degradation, A. ilia has been designated as endangered or 
under priority protection in some countries or regions, 
including Belgium(Li & Fu, 2000). 

The mitochondrial genome of insects is a circular 
and double-stranded molecule of approximately 14-20 kb 
in size with highly conserved exon arrangement covering 
a set of 37 genes, namely 13 PCGs, 22 tRNA genes, and 
2 rRNA (srRNA (12S) and IrRNA (16S)) genes (Boore, 
1999; Taanman, 1999). A mitogenome features smaller 
sizes, faster evolutionary rates, higher conservative gene 
content, maternal inheritance and little recombination 
(Brown, 1983; Avise, 1994), compared to the nucleic 
genome. Thus it has been commonly used for taxonomic 
and phylogenetic studies in many animal groups. The 
availability of complete mitogenome data of more 
species remarkably increases the accuracy and efficiency 
of a variety of research areas, such as molecular 
phylogenetics, phylogeography and taxonomy. 

The complete mitochondrial genomes of nearly 240 
insect species are available, of which only 10 are of 
Papilionoidae, despite its high biodiversity of 17 500 
species (Robbins, 1982). The Nymphalidae is the largest 
butterfly family, and relationships with other butterfly 
groups remain unclear. It is necessary to integrate 
mitogenome data in the reconstruction of Nymphalidae 
phylogenies. In this study we sequenced the entire 
mitogenome of the nymphalid A. ilia, a representative 
butterfly species of the subfamily Apaturinae and compared 
its nucleotide organization to those of other representative 
nymphalid butterfly species. Our aim is to provide 
important molecular data to clarify the phylogenetic 
relationship between A. ilia and other nymphalid butterflies. 


1 Materials and Methods 


1.1 Sample collection and DNA extraction 
Adult individuals of A. ilia were collected from 
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Mount Yandangshan, Zhejiang, China in August 2008. 
Samples were quickly preserved in 100% ethanol and at - 
20 °C until DNA extraction. Total genomic DNA was 
isolated from a single frozen butterfly using the 
proteinase-K-SiO, as follows (Hao et al, 2005). The 
thorax muscle around 5 mm? was removed into a 10 mL 
Eppendorf tube, washed twice with ddH,O, and soaked 
for 2-3 h. Incubation was done with 500 uL DNA 
solution (5 mmol/L NaCl, 0.5% SDS, 15 mmol/L EDTA, 
10 mmol/L Tris-HCl, pH 7.6) and 40 uL proteinase-K 
(20 mg/mL). The muscle was then bathed at 55 °C for 
10~12 h and centrifuged at 4 000 rpm for 2 min. Liquid 
supernatant was transferred to a new 10 mL Eppendorf 
tube with 500 uL 8 mol/L GuSCN and 40 uL 50% clean 
glass liquid mixture, bathed at 37 °C for 1~2 h, shocked 
every ten min and centrifuged at 4 000 rpm for 1 min. 
The supernatant was removed and sediments were 
cleaned twice with 75% alcohol, and once with acetone. 
The sample was dried thoroughly in a vacuum dryer at 
45 °C prior to the addition of 60 uL TE (10 mmol/L Tris- 
Cl, Immol/L EDTA, pH 8.0). The solution was later 
bathed at 56 °C for 30 min, and centrifuged at increasing 
speed untill 4 000 rpm for 1 min. The supernatant 
containing total genomic DNA was removed into a clean 
1.5 mL Eppendorf tube and preserved at -20 °C for use. 

12 Primer design, PCR amplification and DNA 

sequencing 

The universal PCR primers for short fragment 
amplifications of the STRNA, COI and Cyt b genes were 
synthesized after Simon et al (1994) and Simons & 
Weller (2001). Long primers and certain short ones for 
some genes including СОШ and ND5 were designed by 
the multiple sequence alignments of the complete 
mitochondrial genomes of all lepidopterans available 
(Tab. 1), using ClustalX 1.8 (Thompson et al, 1997) and 
Primer Premier 5.0 (Singh et al, 1998) softwares. 

Long PCRs were performed using TaKaRa LA Taq 
polymerase with the cycling parameters: initial 
denaturation for 5 min at 95 °C, followed by 30 cycles of 
95 °C for 50 sec, 47-61 °C for 50 sec, 68 °C for 2 min 
and 30 sec; and a final extension step of 68 °C for 10 min. 
The short fragments were amplified with TaKaRa Taq 
polymerase: initial denaturation for 5 min at 94 °C, 
followed by 35 cycles of 94 °C for 1 min, 45-53 °C for 1 
min, 72 °C for 2 min; and a final extension step of 72 ° 
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for 10 min. The PCR products were detected via 
electrophoresis in 1.2% agarose gel, purified using the 
3S Spin PCR Product Purification Kit and sequenced 
directly with ABI-3730 automatic DNA sequencer. 
Mitogenome sequence data have been deposited into 
GenBank under the accession number JF437925. 
1.3 Sequence analysis 

All genes and the AT-rich region of the A. ilia 
mitogenome ClustalX 1.8. The nucleotide sequences of 
protein-coding genes were translated according to the 
invertebrate mtDNA genetic code. Fifteen of the 22 
tRNA genes were identified using the software tRNA 
Scan-SE 1.21 (Lowe & Eddy, 1997) and RNAstructure 
4.3 (Mathews, 2006). The remaining 7 tRNA genes were 
drawn manually after comparison with known 
homologous regions of other lepidopteran insects. 
MEGA 5.0 software (Tamura et al, 2007) was used to 
analyze nucleotide composition and codon usage. 


2 Results 


2.1 Gennome organization 

The complete mitogenome of A. ilia is 15 242 bp in 
size. Similar to most insects, the mitogenome has a set of 
37 genes: including 13 protein-coding, 22 tRNA and 2 
rRNA genes. A large noncoding A+T-rich region was 
identified (Fig. 1). This region is of highly variable 


Apatura ilia 


Mitochondrial Genome 
15242 bp 





Fig. 1 Circular map of mitogenome of Apatura ilia 
Abbreviations for genes are as follows: COI-III refers to the cytochrome 
oxidase subunits, Cyt b refers to cytochrome b, and ND1-6, ND4L refers to 
NADH dehydrogenase components. tRNAs are denoted as one-letter 
symbol according to IUPAC-IUB single-letter amino acid codes. 


length in insects and is generally suggested to be the 
replication and transcription origin sites of the mtDNA 
double 1992). 
mitogenome of A. ilia was found to be highly similar to 


strands (Clayton, Furthermore the 
most sequenced lepidopterans in terms of gene order and 
orientation. Nine protein-coding genes were found in the 
major strand and the remaining 4 protein-coding genes in 
the minor strand along the mitogenome(Tab. 1). Besides 
those, the mitogenome of А. ilia has 9 overlapped 
sequences and 12 intergenic sequences. 

2.2 Protein-coding genes, transfer RNA genes and 

ribosomal RNA genes 

Thirteen protein-coding genes for 3 711 amino acids 
were identified in the mitochondrial genome of A. ilia 
(Tab. 2). The longest one is the COI gene with 1 533 bp 
and the shortest one is ATP8 with only 159 bp. Twelve 
protein-coding genes were initiated by conventional start 
codon ATN, while only the COI gene was tentatively 
designated to be CGA as the start codon. In A. ilia, nine 
protein-coding genes ended with TANs (7 with TAA, 2 
with TAG), while four genes ended with a single T right 
ahead of tRNA genes (Tab. 1). 

Twenty-two tRNA genes were found in the 
mitogenome (Fig. 2), all of the cloverleaf secondary 
structure except for tRNA“ (AGN) which harbors a 
simple loop in the DHU arm. Twenty-two tRNA genes 
ranged from 62 bp for tRNA“ and (ВМА (AGN) to 
71 bp for tRNA™" in length(Tab. 3). 

As in other lepidopteran species, the А. ilia 
mitogenome was found to harbor two rRNA genes, 
SrRNA (776 bp) and IrRNA (1,333 bp). They are located 
between tRNA‘ (CUA) and an A+T-rich region, 
separated by tRNA‘. 

23 A+T-rich 
overlapping sequences 

The AT-rich region of A. ilia was found to be 403 
bp in length, located between srRNA and (ВМА М“. It had 
the highest AT content (92.596) across the whole 
mitogenome (Tab. 2), typical in Nymphalidae insects 
from 89.6% (Melnitis leda) (Unpublished) to 96.3% 
(Libythea celtis) (Unpublished)..In addition, the values of 
the AT skew and GC skew for this region reached to - 
0.07 and -0.14, respectively. 


region, intergenic spacer and 


Thirteen intergenic spacer sequences were 
determined with a total length of 155 bp (Tab. 4). The 
longest two intergenic spacer sequences were both 49 bp 
long and located between tRNASm and ND2 and ND6 
and Cyt b. The shortest one was only 1 bp in size. The 
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Tab. 1 Summary of the Apatura ilia mitogenome 
Gene Direction Nucleotide no. Size Anticodon Start codon Stop codon 
tRNA F 1-68 68 CAT32-34 
ВМА! Е 69-133 65 САТ98-100 
ВМАВ!" R 131-199 69 TTG235-237 
ND2 F 249-1262 1014 ATT TAA 
tRNA"? F 1261-1328 68 TCA1292-1294 
tRNA“ R 1321-1387 67 GCA1356-1358 
tRNA™ R 1388-1452 65 GTA1419-1421 
COI F 1461-2993 1533 CGA TAA 
tRNA'"(UUR) F 2989-3055 67 TAA3019-3021 
coll F 3056-3731 676 ATG T-tRNA 
tRNAs F 3732-3802 71 CTT3762-3764 
tRNA“? F 3803-3869 67 GTC3834-3836 
ATPase8 F 3870-4028 159 ATT TAA 
ATPase6 F 4022-4675 654 ATG TAG 
COIII F 4684-5475 792 ATA TAA 
tRNAGY F 5478-5542 65 TCC5508-5510 
ND3 F 5540-5894 355 ATA T-tRNA 
tRNA^™ F 5895-5960 66 TGC5924-5926 
tRNANS F 5960-6021 62 TCG5987-5989 
tRNA^™ F 6024-6088 65 GTT6055-6057 
tRNA*"(AGN) F 6087-6148 62 GCT6108-6110 
tRNAS" F 6151-6214 64 TTC6179-6181 
tRNA R 6215-6278 64 GAA6246-6248 
ND5 R 6279-8010 1732 ATT T-tRNA 
tRNA" R 8011-8076 66 GTG8042-8044 
ND4 R 8077-9415 1339 ATG T-tRNA 
ND4L R 9417-9701 285 ATG TAG 
tRNA™ F 9714-9777 64 TGT9744-9746 
tRNA’? R 9778-9841 64 TGG9810-9812 
ND6 F 9844-10371 528 ATA TAA 
CytB F 10421-11572 1152 ATG TAA 
tRNAS"(UCN) F 11571-11636 66 GCT11601-11603 
ND1 R 11650-12591 942 ATG TAA 
tRNA'"(CUN) R 12599-12666 68 TAG12635-12637 
LrRNA R 12667-13999 1333 
tRNA R 14000-14063 64 TAC14032-14034 
SrRNA R 14064-14839 776 
D-loop 14840-15242 403 





“tRNA abbreviations follow the IUPAC-IUB three-letter code. 
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Fig. 2 Predicted secondary cloverleaf structures for the 22 tRNA genes of Apatura ilia 

The tRNAs are labeled after amino acid abbreviations. Nucleotide sequences from 5° to 3’ are indicated for tRNA“". Dashes (-) indicated Watson-Crick base- 
pairing, and centered asterisks ( * ) indicated G-U base-paring. Arms of tRNAs (clockwise from top) are the amino acid acceptor (AA) arm, TYC(T) stem, the 
anticodon (AC) arm, and the dihydrouridine (DHU) arm. 
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Tab. 2 Comparative characteristics of nymphalid mitogenomes 








en Size AT №. РСС? tRNA stRNA D-loop GenBank un 
(bp) (9€) codons? АТ (%) Size AT(%) Size AT(%) Size AT(%) Accession No. 
S. charonda 15244 79.9 3695 782 1323 844 775 850 380 918 NC014224 GenBank 
S.kuriyamaensis 15222 79.9 3588 78.3 1311 842 775 850 380 91.8 NC014223 GenBank 
C. dauidis 15267 804 3737 78.8 1337 838 773 85.9 403 92.5 HQ658143 Xia et al, 2011 
А. hyperbius 15156 808 3718 794 1330 845 778 85.2 349 954 JF439070 Wang et al, 2011 
A. ilia 15242 805 3711 78.9 1333 860 776 849 403 92.5 ЈЕ437925 This study 
A. issoria 15245 797 3728 78.0 1331 839 788 837 430 960 GQ376195 Hun et al, 2010 
H.autonoe 15489 79. 3728 76.8 1335 837 775 852 678 94.6 GQ868707 Kim et al, 2010 
M. leda 15122 798 3723 79.0 1332 840 771 85.0 317 89.6 ЈЕ905446 Unpublished 
E.mulciber 15166 813 3725 803 1314 846 776 853 399 93.5 HQ378507 Unpublished 
L. celtis 15164 812 3722 799 1335 847 775 854 328 96.3 HQ378508 Unpublished 
A. metis 15236 80.5 3704 78.9 1333 845 779 848 394 929 NC015537 GenBank 
*Termination codons were excluded in total codon count; "Protein-coding genes. 
Tab. 3 Sizes of each region of tRNAs in Apatura ilia 
M I QW C Y UP К D G A R N S E F Н T P S L v 
Region 
68 65 69 68 67 65 67 71 67 65 66 62 65 62 64 64 66 64 64 66 68 64 
Amino-acylstem 7 7 7 7 7 7 T T T 7 T 7 T 7 7 T 7 "7 7 7 T 7 
DHU stem 3 4 4 4 3 3 3 4 4! 3 3 - 4 4 4 4 4 3 4 
Anticodon stem 5 5 5 S 5 5 S s 5 5 5 5 5 5 5 5 5 5S s s$ sg s 
TYC stem 5 4 5 4 3 4 5 4 4 4 5 4 4 4d 5 3 3 4 5 5 5 4 
DHU loop 6 6 5 6 4 8 7 7 6 5 4 4 8 7 3 5 7 5 4 5 6 4 
Anticodon loop Je. ege Jp. qs 2 Ws apo pe cp Л y Cun Spo. sque. Op МТ qoo cgo e opo эў 
Variable loop 4 5 4 4 4 4 4 5 4 4 4 5 5 7 4 4 5 4 4 4 3 4 
TYC loop 5 4 7 7 10 4 5 9 6 S 5 4 3 8 4 6 6 4 3 4 8 5 


Transfer RNA genes are labeled by one-letter symbols according to IUPAC-IUB single letter amino acid codes. L, 12, S and S? indicate tRNA'*'(CUN), 
tRNA'"(UUR), МАЎ (АСМ) and tRNAS*“(UCN), respectively. Superscripts indicate number of mismatches in each region. 


mitogenome of A. ilia contains nine overlapping sequences 
ranging from | bp to 8 bp and totaling 33 bp in length. 


3 Discussion 


3.1 Gene organization and composition 

The length of the complete mitogenome of A. ilia 
falls in the known range of the lepidopteran insects from 
15 122 bp in M. leda to 16 094 bp in Agehana marho 
(Papilio maraho) (Wu et al, 2010). A. ilia demonstrated 
a common lepidopteran gene order tRNA™" followed by 
tRNA" and by tRNA°", different from those of other 
insect groups (tRNA™ followed by tRNA ©" and by 
(ЕМА М (Tab. 1). Gene order was used to explore the 
presumed independent evolutionary lepidopteran lineages 
after divergence from their common ancestors (Boore et 
al, 1998). 

The A. 
mitogenome showed considerable bias towards an A+T 


nucleotide composition of the ilia 


preference (80.5%) (Tab. 2), a common characteristic 
observed in insect mitochondrial genomes, ranging from 
69.5% to 84.9 % (Crozier & Crozier, 1993; Dotson & 
Beard, 2001). It was noted that the content of base T 
(40.7%) was slightly higher than base A (39.8%), 
resulting in an AT skewness value of -0.012. The GC 
composition (19.5%) was correspondingly lower than 
AT (Tab. 2) and the GC skewness value was -0.21. 
3.2 Protein-coding genes 

The putative start codons were found to be the same 
as in lepidopteran mtDNA (ATN codons: 3 with ATA, 6 
with ATG, 3 with ATT) (Tab. 1), except that the COI 
gene has no uniform start codon (Lessinger et al, 2000; 
Yukuhiro et al, 2002). In general, lepidopteran insects 
were quite conservative in using CGA to initiate the COI 
gene, such as in Eriogyna pyretorum (Jiang et al, 2009), 
Hyphantria cunea (Liao et al, 2010) and Adoxophyes 
honmai (Lee et al, 2006). However, there are some 
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Tab. 4 Overlapping and intergenic spacer sequences of nymphalid mitogenomes 
Region S.cha S.kur C.dau A.hyp A ili A.iss H.aut M.led A.met E.mul L.cel 
tRNAVtRNA™ - - (O)2 - 3 = - - (D9 (O)2 
tRNA" -tRNAS" (0)3 (0)3 (0)3 (0)3 (0)3 (0)3 (0)3 (0)3 (0)3 (0)3 (0)3 
(ЕМАС? №р2 (1)87 (D87 (1)46 (1)52 (D49 (D51 (1)50 (D72 (D60 (D46 (D52 
ND2-tRNA™ (D127 (0)15 (0)2 (0)2 (002 (04 = (02 (0)2 (02 (D1 
tRNAT?4RNASS (0)8 (0)8 (О)8 (0)8 (0)8 (0)8 (0)8 (9)8 (0)8 (9)8 (О)8 
ВМАС (АМАТ - - Е (O)1 - (01 - - : (01 - 
tRNA™'-COI (0)19 (0)19 (D1 (D3 (D8 = (D8 (02 (D5 (01 (D5 
COI-tRNA“* (0)3 (0)5 z В (0)5 : : : Н - п 
(КМА СОП - - (01 - - - - - И - - 
COII-tRNAP* (0)3 (0)3 (0)35 Е В (0)35 = (0)35 - - : 
tRNAP^4RNA^?  (O)1 (0)1 (20 (О)1 Е (0)1 (01 (011 2 (0)1 (О)1 
tRNA“?-ATP8 - z E z 3 - s - = 
ATP8-ATP6 (0)7 (0)7 (0)7 (0)7 (0)7 (0)7 (0)7 (0)7 (0)7 (0)7 (0)7 
ATP6-COIII - : (O)1 (0)1 (D8 (0)1 (0)1 (0)1 (0)1 (0)1 (0)1 
COIILtRNASY (D2 (D2 (D2 (D2 (D2 (D2 (D2 (D2 (D2 (D6 (02 
tRNASY-ND3 (0)3 (0)3 - - (0)3 : (0)3 - - и - 
ND3-tRNA“* (0)2 (02 (022 (D8 - (04 > > (0)2 (0)2 (04 
tRNA^-(RNA^* (0)1 (0)1 . (02 (0)1 (02 : (01 (0)1 (0)1 А 
tRNA“®4RNA“A" - - (01 - (02 - Б (02 (02 (01 (D5 
tRNA“"-tRNAS* (07 (0)10 (0)2 (0)2 (02 (02 (0)3 (0)3 (0)2 (О)2 (О)2 
tRNAS™ (АМА 
tRNAS*4RNAS" (01 (01 (1)28 (01 (02 (01 (02 - (D2 (D8 (01 
ВМА" (АМАР (O)2 (0)2 (O)2 (D3 - (0)2 (O)2 (0)2 - (0)2 (02 
tRNAP'*- NDS (0)26 (0)26 (0)1 (О)1 - (0)1 S (0)29 - - - 
ND5-tRNA™ (D15 Z : (D15 x (D8 (0)3 Н (24 (D15 z 
(ЕМАН МР4 (0)53 (D116 (O)1 (O)1 - (D4 с : - > 
ND4-ND4L (O)1 (D14 (О)1 (02 (01 - (0)1 (D23 (D1 (0)1 (02 
ND4L-tRNA™ (D5 (D5 (D5 (D2 (D12 (02 (D2 (D2 (D12 (D2 (D2 
tRNA™-tRNA™ - - Я - > - з - 
tRNAP"-ND6 (D51 (02 (02 (04 (02 (D5 (02 (D5 (02 (02 (02 
ND6-CytB (D6 (D6 (0)1 (О)1 (049 (03 (O)1 (0)1 (1)37 (04 (0)1 
CytB-tRNAS* (02 (D355 - (0)2 (002 (092 (D8 (D8 (0)2 (02 (D2 
tRNAS*-ND1 (О)1 (О)1 (0)2 (0)2 (D13 (002 (D6 (D17 (013 (020 (D17 
ND1-tRNA‘ (D10 (01 (01 (D3 (07 z (D1 (D1 (D1 (D1 (01 
tRNA‘-IrRNA k (D11 - - В Е а 5 р : В 
IrRNA-£RNA Y” (D7 (D7 = z (O)1 2 Е А 
(ЁМАУ 5 ВМА (01 (01 И н г д - : E - Е 
таьна OOS Om Qe Qu O5 ан Qe Os Oc Qe Qu 


The 22 tRNA are denoted by three-letter symbol. Species are indicated by the initial letter of its genus and the initial three letters of its specific name (see the 


corresponding full name in Tab. 2). (O) Overlapping sequences; (I) Intergenic spacer. Empty column means neighboring genes abutting each other. 


exceptions. A previous study using transcript information 
from the cDNA sequence showed that the start codon for 
the COI gene was TCG (Serine) in dipteran insects 
(Krzywinski et al, 2006). In addition, TTAAAG has been 
previously proposed to be the start codon for the COI 
gene in Pieris rapae (Mao et al, 2010), ATTACG for 
Papilio xuthus (Feng et al, 2010), TTAG for Corean 
raphaelis (Kim et al, 2006), and TTG for Acraea issoria 


(Hu et al, 2010) and Calinaga davidis (Xia et al, 2011). 
In A. ilia, four genes ended with a single T right 
ahead of tRNA genes (Tab. 1). The single T residue 
could be completed into triplet codons by 
polyadenylation (Clary et al, 1985), and the tRNA 
secondary structure is functional to the precise cleavage 
of the mature protein-coding genes from the primary 
multicistronic transcripts (Ojala et al, 1980, 1981). 
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The AT bias of the protein-coding genes in A. ilia 
was prominent with AT content of 78.9%, the same as 
the average value of sequenced nymphalid mitogenomes 
(78.9%) (Tab. 2). Additionally, the PCG nucleotide 
frequency was T>A>G>C, displaying significant skews 
at AT (—0.15) and GC (0.02), both comparable to other 
sequenced lepidopterans (Liao et al, 2010). Examination 
of the concatenated 13 PCGs showed that the third codon 
position (91.7%) contained higher AT content than the 
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first (74.4%) and second (70.9%) positions, and this case 
is also similar to other sequenced lepidopteran species 
like Happrchia autonoe (Kim et al, 2010). As for A+T 
content among 13 PCGs, ATP8 had the highest (93.9%) 
and COI has the lowest (72.4%) values (Tab. 2). 

The A. ilia mitogenome also has an AT bias in codon 
usage. Among 3 711 codons there were 451 UUAs, 439 AUUs, 
344 UUUs, 270 AUAs, 229 AAUs and 164 UAUs. These 
codons have much higher frequencies than others (Tab. 5). 


Tab. 5 Codon usage of PCGs in the Apatura ilia mitogenome 





Codon Num. RSCU Codon Num. RSCU 
UUU(F) 344 1.81 UCU(S) 106 2.58 
UUC(F) 36 0.19 UCC(S) 9 0.22 
UUA(L) 451 5.03 UCA(S) 100 2.43 
UUG(L) 25 0.28 UCG(S) 1 0.02 
CUU(L) 38 0.42 CCU(P) 62 2.10 
CUC(L) 4 0.04 CCC(P) 18 0.61 
CUA(L) 20 0.22 CCA(P) 36 1.22 
CUG(L) 0 0.00 CCG(P) 2 0.07 
AUU(I) 439 1.88 ACU(T) 77 2.07 
AUC(I) 27 0.12 ACC(T) 11 0.30 
AUA(M) 270 1.81 ACA(T) 61 1.64 
AUG(M) 29 0.19 ACG(T) 0 0.00 
GUU(V) 64 1.94 GCU(A) 67 2.37 
GUC(V) 1 0.03 GCC(A) 8 0.28 
GUA(V) 62 1.88 GCA(A) 37 1.31 
GUG(V) 5 0.15 GCG(A) 1 0.04 





Codon Num. RSCU Codon Num. RSCU 
UAU(Y) 164 1.76 UGU(C) 30 1.76 
UAC(Y) 22 0.24 UGC(C) 4 0.24 
UAA(*) 0 0.00 UGA(W) 89 1.89 
UAG(*) 0 0.00 UGG(W) 5 0.11 
CAU(H) 56 1.65 CGU(R) 16 1.21 
CAC(H) 12 0.35 CGC(R) 2 0.15 
CAA(Q) 53 1.83 CGA(R) 30 2.26 
CAG(Q) 5 0.17 CGG(R) 5 0.38 
AAU(N) 229 1.81 AGU(S) 35 0.85 
AAC(N) 24 0.19 AGC(S) 2 0.05 
AAA(K) 93 1.79 AGA(S) 75 1.82 
AAG(K) 11 0.21 AGG(S) 1 0.02 
GAU(D) 61 1.82 GGU(G) 53 1.08 
GAC(D) 6 0.18 GGC(G) 12 0.24 
GAA(E) 66 1.81 GGA(G) 102 2.07 
GAG(E) 7 0.19 GGG(G) 30 0.61 


A total of 3 711 codons are analysed, excluding the termination codons. Num: number of each codon. RSCU: relative synonymous codon usage. *: stop codons. 


3.3 Transfer RNA genes and ribosomal RNA genes 

The ОНО arm of КМА (AGN) has only a simple 
loop, common in most insects (Hong et al, 2008; Kim et 
al, 2006; Salvato et al, 2008; Wolstenholme, 1992). 
Similar to Parnassius bremeri (Kim et al, 2009), the 
tRNAs of A. ilia harbor 7 base pairs in amino-acyl stems, 
5 base pairs in anticodon stems, and 7 base pairs in 
anticodon loop. However, the base pair numbers vary in 
other tRNA portions, especially within the ТУС loops 
(3-10 bp) (Tab. 3). 

Among the 22 tRNA genes 32 mismatched base 
pairs were found, 10 on the amino acyl stem, 9 on the 
DHU stem, 1 on the TYC stem and 9 on the anticodon 
stem. Twenty were between guanine and uracil, 
justifiable in terms of structural stableness (Topal & 
Fresco, 1976). Some unconventional mismatches were 
also observed, e.g. A-C (1), A-G (1), and U-U (8). 
Similar cases were also seen in other lepidopteran 


species. For example, C. raphaelis has 8 U-U 
mismatches in tRNAs; A. issoria exhibits a G-A and a C- 
U mismatches in the tRNA"; Н. autonoe contains a U-U 
and a A-C mismatches in tRNA‘ (UUR), Н. cunea has 
U-U mismatches іп tRNA*“", tRNA‘ (CUN), and 
tRNA‘ (UUR) (Liao et al, 2010). These mismatches 
can be corrected through RNA-editing mechanisms that 
are well known for arthropod mtDNA (Lavrov et al, 2000) 

As in other lepidopteran species, the A. ilia 
mitogenome was found to harbor two rRNA genes (776 
bp srRNA and 1 333 bp IrRNA). The AT content of 
SrRNA was 84.9%, similar to those of other lepidopteran 
insects (87.5% for Phthonandria atrilineata (Yang et al, 
2009), 82.0% for Ostrinia furnacalis (Coates et al, 2005)) 
(Tab.2). The AT content of IrRNA (85.0%) also fell into 
the range for other lepidopteran insects (85.1% for P. 
atrilineata, 81.4% for Ostrinia Lunifer (Salvato et al, 
2008)) (Tab. 2). 
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3.4 Intergenic spacer and overlapping sequence 

The spacer | of 49 bp was located between the 
tRNAS™ and ND2 genes and this spacer had the same 
size to that in O. furnicalis, Ostrinia nubilalis (Coates et 
al, 2005), B. mori and B. mandarina (Yukuhiro et al, 
2002). It has been suggested that the spacer would 
remain invariant in length in some congeneric species 
(Cameron & Whiting, 2008), however, new findings 
showed divergence among Apatura Metis (unpublished, 
NC-015537) and A. ilia (Tab. 4). The comparison of this 
spacer with the neighboring ND2 gene showed 63% 
homology. Likewise, the cases for H. autonoe (74%), P. 
bremeri (70%), P. atrilineata (70%), C. raphaelis (62%), 
A. melete (70%) and B. mori (63%) all suggest that this 
spacer sequence originated from the ND2 gene (Kim et 
al, 2009). 

Like spacer 1, spacer 2 was also 49 bp in length and 
found between ND6 and Cyt b. It is notable that a 
microsatellite-like repeat (TA); was identified within 
this region. This case is extremely rare among the known 
nymphalid mitogenomes (the other example is in A. 
metis which has (ТА), in this region). 

Spacer 3 was 13 bp and located between tRNA?* 
(CUN) and NDI. This spacer sequence held a 7-base 
motif ATACTAA. A similar motif has been identified in 
previous studies as a plausible conservancy in all 
Lepidoptera species sequenced so far (Kim et al, 2009; 
Cameron & Whiting, 2008; Liao et al, 2010; Salvato et al, 
2008). It may be functionally essential in the recognition 
of the mtDNA TERM (the transcription termination 
peptide). 
demonstrated mixed results in Nymphalidae species. For 


Recent mitogenomic sequence data has 
example, in H. autonoe a 6-bp spacer (Kim et al, 2010) 
was located between (ЕМА (CUN) and NDI, and the 
7-bp motif ATACTAA located within tRNAS (CUN), 
whereas in C. dauidis, A. hyperbius and A. issoris, the 
two genes were overlapped for 1 or 2 bp and the motif 
was located at the 3’ end of the ND1 gene. 

The 12-bp spacer 4 was located between ND4L and 
tRNA™, being longer than those of other nymphalid 
species. All remaining spacers in A. ilia are less than 10 
bp. 

We also found two overlapping sequences which 
are conservative in Lepidoptera, one was 7 bp long and 
the other 8 bp. The 7-bp sequence was located between 
ATP8 and ATP6 as ATGATAA, and the 8-bp overlap 


was located between tRNA™ and tRNA“ as 


AAGCCTTA, These two sequences have also been 
detected in other lepidopteran insects, such as P. bremeri, 
H. cunea and P. atrilineata. Based on data of the 
the two 


presently sequenced lepidopteran insects, 


overlapping sequences postulated to be 
conservative across lepidopteran insect taxa. Another 5- 
bp sequence was located between COI and tRNA‘. The 


remaining five overlapping sequences range from 1 to 3 


were 


in size. So far the 3-bp overlap between tRNA" and 
tRNA“ has been found in all sequenced Nymphalidae 


species. 
3.5 A+T-rich region 
The AT-rich region functional in mtDNA 


replication and transcription (Taanman, 1999). The 
origin of the major-strand replication was studied in the 
AT-rich region in vertebrates (Tapper & Clayton, 1981), 
followed by the detection on both strands of mtDNA in 
1987; 
In recent years 


Drosophila species (Clary & Wolstenholme, 
1980). 
investigations were expended to the replication origin of 


Fauron & Wolstenholmn, 


the minor-strand in Diptera, Lepidoptera, Coleoptera and 
Orthoptera, and the results suggest that the replication 
origin site of mtDNA minor-strand in insects is located 
before the poly-T structure, which is standing at the 3° 
end of the AT-rich region (Saito et al, 2005). 

The AT-rich region of А. ilia has some common or 
similar structural features for lepidopteran insects. It 
harbors a 21-bp poly-T stretch located 18 bp upstream 
from srRNA and preceded by a motif ATAGA. The 
poly-T stretch and the motif composed the origin site for 
the minor-strand replication, recognizable as a structural 
signal by regulating proteins (Kim et al, 2009). 
Additionally, there are two microsatellite-like repeats of 
(ТА); and (TA), which are preceded by a conserved 
motif ATTTA and located upstream of (TA);o repeat. At 
the 5’ end of this region there is a 14-bp poly-A structure, 
shortened to be 10 bp by inserting a single T base. A 
similar case was found for other nymphalid butterfly 
species; for example, the poly-A of A. issoris mtDNA 
was inserted by a guanine. Functionally, the poly-A was 
assumed to be the replication origin location of the 
mtDNA major-strand because of its connection with 
tRNA™" (Kim et al, 2009). 
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地 图 勘误 说 明 


地 图 事 关 国 家 安 人 全、 主权、 民族 尊严 和 中 华 民 族 的 根本 利益 ， 历 来 受到 国家 的 高 度 重视 。 依据 《中 


华人 民 共 和 国 地 图 编制 出 版 管理 条 例 》 和 《地 图 审核 


管理 规定 》 的 有 关 要 求 ， 中 国 地 图 全 图 中 不 得 缺少 


香港 特别 行政 区 、 澳 门 特别 行政 区 、 海 南 岛 、 台 湾 岛 、 钓 鱼 岛 、 南 海 诸 岛 、 赤 尾 屿 及 海域 国界 线 等。 本 
F) 2011 年 32 AB 181. 569 和 578 页 相应 示意 图 的 中 国 地 图 全 图 中 缺少 南海 诸 岛 、 的 鱼 岛 、 赤 尾 同 及 海 
域 国界 线 等 ， 特 此 勘误 说 明 。 和 希望 此 勘误 能 够 引起 作者 和 编者 的 足够 重视 。 
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PUKA F А KMS (UE. dE. A-BEZ ARR» —- BE 2012 年 1 月 由 中 山大 学 出 版 
社 出 版 。 书 中 全 面 讨 论 了 人 与 猿 猴 的 异同 ， 党 试 从 灵 长 类 进化 的 新 视角 探索 人 性 起 源 ， 解 释 人 类 的 共通 
性 及 其 生物 学 本 质 ， 并 为 读者 提供 理解 自身 的 新 思维 。 目 前 国内 灵 长 类 的 书籍 不 多 ， 欢 迎 读者 订阅 ! 


《 猴 、 猿 、 人 一 一 思考 人 性 的 起 源 》 简 介 : 

毫 无 疑问 ， 人 类 是 600 余 种 灵 长 类 中 的 一 种 。 解 剖 学 者 说 猿 猴 的 手掌 、 指 纹 、 大 脑 等 身体 结构 与 人 
类 似 。 行 为 生态 学 者 说 猿 猴 的 行为 和 复杂 社会 关系 与 人 类 似 。 人 类 学 者 说 猿 猴 的 工具 使 用 技能 和 文化 传 
播 可 能 接近 于 人 类 的 锥 形 阶段 。 心 理学 者 说 猿 猴 是 地 球 上 最 聪明 的 动物 。 保护 生物 学 学 者 说 有 猿 猴 生活 
的 森林 一 定 是 最 健康 的 和 森林， 保护 猿 猴 就 是 保护 森林 生态 。 医 学 工作 者 说 猿 猴 的 免疫 系统 接近 于 人 ， 是 
克服 人 类 疾病 的 最 住 模型 动物 。 分子 生 物 学 者 说 黑猩猩 属 与 人 类 基因 相似 度 高 达 98. 7%， 是 与 人 类 最 近 
缘 的 物种 。 可 以 说 ， 狼 猴 是 一 面 多 棱镜 ， 从 方方面面 映射 出 人 性 起 源 的 背景 。 

本 书 是 目前 国内 第 一 本 综合 介绍 灵 长 类 知识 的 书籍 。 书 中 全 面 讨论 了 人 与 猿 猴 的 异同 ， 尝 试 从 灵 长 
类 进化 的 新 视角 探索 人 性 起 源 ， 解 释 人 类 的 共通 性 及 其 生物 学 本 质 ， 并 为 读者 提供 理解 自身 的 新 思维 。 
第 一 章 介绍 了 人 类 的 生物 属性 ， 第 二 章 进 一 步 详 细 介 绍 了 近代 灵 长 类 学 的 诞生 与 发 展 过程 ; 从 第 三 章 到 
第 十 章 ， 分 章节 介绍 了 灵 长 类 的 起 源 与 分 类 、 和 生态、 行为、 社会 、 文 化 、 智 能 、 疾 病 与 遗传 、 生 命 伦 理 
与 保护 等 各 方面 知识 。 第 十 一 章 总 结 了 人 类 与 猿 猴 的 区 别 ， 以 及 解答 如 何 界 定 人 类 等 长 期 未 解 的 自身 谜 
Йй). 






《 猴 、 猿 、 人 一 一 思考 人 性 的 起 源 》 章 节目 录 : 
序 一 (日 本 京都 大 学 渡 边 邦 夫 ) 
序 二 (中 山大 学 麻 国庆 ) 


中 山大 学 人 文 社会 科学 出 
` b 


版 基金 资 动 


前 言 
一 思考 人 性 的 起 源 第 一 章 人 类 的 生物 属性 
hit Еи 第 二 章 ， 灵 长 类 学 的 诞生 与 发 展 
хоя 第 三 章 ， 灵 长 类 的 起 源 与 分 类 
第 四 章 灵 长 类 的 生态 
PRE 灵 长 类 的 行为 
BRE 灵 长 类 的 社会 
第 七 章 灵 长 类 的 文化 
„ж 


BNE 灵 长 类 的 智能 

第 九 章 “ 灵 长 类 的 疾病 与 遗传 
第 十 章 “ 灵 长 类 的 生命 与 保护 
第 十 一 章 ， 结语: 人 性 的 起 源 
参考 文献 

附录 
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买 ， 当 当 网 http: //product. dangdang. com/product. aspx?product..id-20727953, 
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